Applying the OntoLex Model to a Multilingual Terminological Resource

نویسندگان

  • Julia Bosque-Gil
  • Jorge Gracia
  • Guadalupe Aguado de Cea
  • Elena Montiel-Ponsoda
چکیده

Terminesp is a multilingual terminological resource with terms from a range of specialized domains. Along with definitions, notes, scientific denominations and provenance information, it includes translations from Spanish into a variety of languages. A linked data resource with these features would represent a potentially relevant source of knowledge for NLP-based applications. In this contribution we show that Terminesp constitutes an appropriate validating test bench for OntoLex and its vartrans module, a newly developed model which evolves the lemon model to represent the lexicon-ontology interface. We present a first showcase of this module to account for variation across entries, while highlighting the modeling problems we encountered in this effort. Furthermore, we extend the resource with part-of-speech and syntactic information which was not explicitly declared in the original data with the aim of exploring its future use in NLP applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains

The development of a multilingual terminology is a very long and costly process. We present the creation of a multilingual terminological database called GRISP covering multiple technical and scientific fields from various open resources. A crucial aspect is the merging of the different resources which is based in our proposal on the definition of a sound conceptual model, different domain mapp...

متن کامل

Proposing an OntoLex-MMoOn Alignment: Towards an Interconnection of two Linguistic Domain Models

This paper motivates and proposes to align the OntoLex and MMoOn Core models. It deals in particular with the ontolex and decomp modules and their potential as ontological foundation to represent the domain of morphological language data (MLD). It will be argued that ontolex and decomp provide only a basic modelling of the domain, which is not sufficient for representing fine-grained MLD, but s...

متن کامل

Towards the Integration of Multilingual Terminologies: an Example of a Linked Data Prototype

Many language resources are nowadays available in machine readable formats, but still contained in isolated silos. Current Semantic Web-based techniques enable the transformation and linking of those resources to become a navigable graph of linked language resources, which can be directly consumed by third-party applications. The prototype we have developed builds on a web user interface and SP...

متن کامل

Towards Multilingual eLexicography by Means of Linked (Open) Data

In this short paper, we document the current state of work consisting in mapping various lexicographic resources onto the OntoLex model, which is an OWL and RDF(s) based representation format. This model has been designed in the context of a W3C Community Group effort for supporting the publication of linguistic data in the Linked (Open) Data cloud. The deployment of OntoLex is currently being ...

متن کامل

Building Specialized Multilingual Lexical Graphs Using Community Resources

We are describing methods for compiling domain-dedicated multilingual terminological data from various resources. We focus on collecting data from online community users as a main source, therefore, our approach depends on acquiring contributions from volunteers (explicit approach), and it depends on analyzing users’ behaviors to extract interesting patterns and facts (implicit approach). As a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015